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ALMOST PRIME TRIPLES AND CHEN’S THEOREM 


ROGER HEATH-BROWN AND XIANNAN LI 


Abstract. We show that there are infinitely many primes p such that not only does 
p + 2 have at most two prime factors, but p + 6 also has a bounded number of prime 
divisors. This refines the well known result of Chen [3. 


1. Introduction 

The twin prime conjecture states that there are infinitely many primes p such that p + 2 
is also prime. Although the conjecture has resisted our efforts, there has been spectacular 
partial progress. One well known result is Chen’s theorem [2J that there are infinitely 
many primes such that p + 2 has at most two prime factors. In a different direction, 
building on the work of Goldston, Pintz, and Yildirnn [5], it has recently been shown 
by Zhang ECU that there are bounded gaps between consecutive primes infinitely often. 
The numerical result has been improved in the works of the Polymath8 project [9] and 
Maynard [7], and the bounded gaps result has also been extended to prime tuples by 
Maynard [7] and Tao (unpublished). 

The twin prime conjecture is a special case of the Hardy-Littlewood conjecture, which 
postulates asymptotics for prime tuples in general. An example is that one expects that 
the number of primes p < x such that p + 2 and p + 6 are simultaneously prime should 
be asymptotic to 

log x 

for a certain positive constant C (given by fj32jl ). In this direction, it has been proven that 
there are infinitely many natural numbers n such that n(n + 2)(n + 6) is almost prime - 
that is, n(n + 2)(n + 6) has at most r prime factors, for some finite r. More specifically, 
Porter m proved this statement for r — 8 and this was improved by Maynard [8] to 
r = 7. 

We are interested in proving an analogue of Chen’s theorem for prime tuples. More 
precisely, we show that there are infinitely many primes p such that p + 2 has at most 
two prime factors, and p + 6 has at most r prime factors for some finite r. 


Theorem 1. Let vr 12 , r (x) denote the number of primes p < x such that p + 2 has at most 
two prime factors and p + 6 has at most r prime factors. Then 

x 


( 1 ) 

for r = 76. 


ni,2, r ( x ) > 


log 3 x 


Our basic philosophy, which the proof will illustrate, is the following. Suppose one 
has polynomials fi(x ),..., f k +i(x) and positive integers ry, ..., r k . Then, if the weighted 
sieve can prove that 

/i(n) = P ri ,..., f k (n) = P rk 
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for infinitely many integers n, then one should be able to modify the argument to show 
the existence of a positive integer r k+ \ such that 

/i(n) = P n ,..., f k+1 (n) = P rk+1 
for infinitely many integers n. 

Our approach uses the weighted sieve which appeared in Chen’s original work, as well 
as the vector sieve of Briidern and Fouvry pQ. We will also use a Selberg upper bound 
sieve of “mixed dimension”. The value of r in our theorem could be improved by using 
a more elaborate weighted sieve, but we will not pursue this. 

Acknowledgement. We would like the thank the referee for a very careful reading 
of the manuscript, with special attention to the numerical work described at the end 
of Section 4. In particular, we are indebted to the referee for the neat calculation of 
the values of B( 1, (4# 2 ) -1 ) and J(9i,9 2 ,9). This work was supported by EPSRC grant 
EP/K021132X/1. 


2. The basic setup 

In the sequel, p and pi shall always denote primes. Let e > 0 be a small positive 
constant, and let x £ < £ 2 < £i < x 1//3 be parameters to be decided in due course. We will 
work with the set 

A = {p + 2 : x 1/3 < p < x - 6, (p + 6, P(£ 2 )) = 1}, 

where 

P(w) = Y[p 

p<w 

as usual. 

The basic idea in Chen’s argument is to consider the expression 

(2) = X S(A.;fi)-iiV 0 , 

where 

No = #{piP2P3 e A : fi < Pi < x 1/3 <p 2 < p 3 }- 
One then has an inequality of the form 

(3) S, < #A^ + #A^ + ^#A^ 2 \ 

where 

A (0) = {n E A : p\\n for some Pi > ^i}, 

A (1) = {n G A : n prime, or n = p\p 2 with p 2 > pi > x 1//3 }, 

and 

A^ = {n G A : n — pip 2 with p 2 > x 1//3 > p± > ^i}. 

The bound which the reader may easily verify, is closely related to the inequality 
used by Halberstam and Richert |6J Chapter 11, (2.1)], for example. 

One immediately has ftA^ x/£i, which will be sufficiently small for our purposes. 
Moreover one can see that if n G U A ^ then n = p + 2 with n = P 2 and n + 4 = P r , 
where r = [(loga:)/(log^ 2 )]. We therefore obtain a result of the type given in our theorem 
provided that we can give a suitable positive lower bound for Si. This can be achieved 
by using the vector sieve of Briidern and Fouvry [1] in place of the usual upper and lower 
bound sieves. 
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There are a number of methods to try to improve the value of r obtained by this naive 
approach. We choose to include a simple weighted sieve in order to eliminate those triples 
(p, p + 2,p + 6 ) for which p + 6 has many prime factors. (The reader will observe that 
one could do better by incorporating more elaborate weights into ([ 2 ]).) 

We proceed to define the sets 

={p + 6: p + 2eA®}, (i — 1 , 2 ) 


and a weight function 


(4) 


w p = 


log p 
log y 


where y = x 1 ^ for some positive constant v to be decided in due course. At this stage 
we insist only that £2 < V < x. Since any element of B W is coprime to P(£ 2 ), and since 
w p < 0 for p > y, we now have 


£ w p # 0 « > £ 


£2 <p<y 


2<p<x 


1 


p\b 


beBtt) 

> J2 M 6 ) 


beBW 


log y 


logx 


&eBW 


Here, as usual, 0 o(b) denotes the number of distinct prime factors of b. It then follows 
from d3J) that if A is any positive constant then 


Si-A £ W V (#»?’ + 

£ 2 <p<y ^ 

< 0(x/£i)+ I +2 ) (1 + ~ Aw(6)) 

\feeB( 1 ) feeB( 2 ) / 

< 0{x/£ 1 ) + (1 + A v)#{b e B w U B {2) : 00 (b) < A" 1 + v} 

(5) < 0(x/^ 2 ) + (1 + A v)#{b G £> (1) U B 1 ' 2 ' 1 : 00 (b) < A~ x + v, b square-free}. 

Here we use the observation the the number of elements of B^ which are not square-free 
must be 0(x/£ 2)5 since any such element is coprime to P(£ 2 ) by definition. 

We therefore seek to show that 


( 6 ) 


Si- 


* w p (# B p } + ^ { c + ol 1 )} 

£2 <p<y ' 


x 

(logo ;) 3 


for some positive constant c. Substituting our expression for S\ from ([2]), we see that we 
must bound S(A] £ 1 ) from below, which we accomplish using a combination of the linear 
sieve with the vector sieve. We require upper bounds for the rest of the terms. Here, we 
use two distinct methods. For 

£ SVC; 6) 

£i<p<z 1/3 
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we will use the vector sieve for some ranges of p and the Selberg sieve for other ranges of 
p. For the remaining terms it turns out to be more efficient to apply the Selberg sieve. 
Our application has the novel feature that the sieving dimension changes from 2 (for 
primes p < £ 2 ) to 1 (for larger primes) part way through the range. Naturally, for the 
term Nq we first apply Chen’s famous “reversal of roles” trick before applying the upper 
bound sieve. 


3. Sieving tools 


3.1. The linear sieve. In the Rosser-Iwaniec linear sieve one has a real parameter I) > 2 
and constructs coefficients X ± (d) supported on the positive integers d < D such that 

A ± (d) = p(d) or 0, for all d < D 

and 

a-(«*)<!>(<*)<£ 

d\n d\n d\n 

for all positive integers n\P(z) for some parameter z. Suppose we have a multiplicative 
function h(d) G [0,1) such that 

p) n 

w<p<z 

for z > w > 2, for some parameter L. 

Iwaniec [3], we have 

(8) E A+(</)/!(</) < {F( S ) + 0 I ((logB)- 1 '' 6 )}\/(i,ft) (s > 1) 

dL\P{z) 

and 

A ~(d)h(d) > {f{s) + 0 L ((log D )- 1 ^)} V(z, h) (s > 2) 

d\P(z) 

where F(s) and f(s) are the standard upper and lower bound functions for the linear 
sieve, with s = (logiA)/(log^), and 

n*, ft) = n (!-%))■ 

p<z 

Moreover one sees from jlj (6.31)—(6.34)] that 

( 9 ) E A ~(d)h(d) < V(z, h) < A + (d)h(d). 

d\P(z ) d\P(z) 


L log Z 


logw 


1 + 


logw 


Then, by Theorem 11.12 of Friedlander and 




3.2. The Fundamental Lemma sieve. Let U be a set of positive integers, possibly 
with multiplicities, and suppose that 

(10) jfU d = h*(d)Y + r(d) 

for some multiplicative function h*(d) G [0,1). We assume for simplicity that 

(11) h*('P) < C 0 P~\ 


for some constant Co > 2. Then 

II (! -V(p))" 1 < K 

w<p<z 


( log^ 

\ log w 


K, 
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for z > w > 2 for appropriate constants K and k depending only on Cq. Hence Corollary 
6.10 of Friedlander and Iwaniec [1] applies, and yields 

S(W; *) = {! + Oc,(e-)}YV(z, h') + 0(J2 k(rf)l), 

d<z 3 


for z > 2 and s > 1. 

An inspection of the proof makes it clear that one only uses (TTOT) for values d \ P(z). 


3.3. The vector sieve. Let W be a finite subset of N 2 . Suppose that Zi,Z 2 > 2 with 

log z 1 x log z 2 

and write z = (zi, z 2 ). For d = (d\, d 2 ) and n = (ni, 712), we write d|n to mean that di\rii 
for 1 < 7 < 2. Define as usual 

W d = {n E W : d|n}, 

and 

5(W;z) = {(m,n) E W : (P(zi),m) = (P(z 2 ),n) = 1}. 

Suppose that 

#W d = h{d)X + r(d) 

for some multiplicative function h(d) E (0,1] such that h(p, 1) + h(l,p) — 1 < h(p,p ) < 
h(p, 1) + h(l,p) for all primes p and 

(12) h(p, l),h(l,p) < Cip -1 , and h(p,p)<C 1 p~ 2 

for some constant C\ > 2. Then using the vector sieve and the linear sieve, we will derive 
both upper and lower bounds for 5'(W; z). 

For i = 1,2, let A f and A“ denote the coefficients of the upper and lower bound linear 
sieves of level 

A = 2,-S (i = 1,2) 

where 1 < Sj < 1. Further, let 6 — p * 1, 8 f = \f * 1 and = A” * 1. Note that 
<S< 5j + , and that 

(13) S(m)S(n) < 5± ( 771)62 (n) 
and 


(14) 6 (m) 6 (n) > 5 1 ( 171)62 (n) + ( n ) — ( 712)62 (n), 

for any natural numbers m and n. 

In applying the vector sieve we will want to replace h(d) by hi(di)h- 2 (d 2 ), where 

h\(d) — h(d, 1), and h, 2 (d) = h(l,d). 

There is no difficulty when di and d% are coprime, but there are potential problems when 
they share a common factor. We circumvent this issue by using a preliminary application 
of the Fundamental Lemma sieve. Suppose we are given zo > 2 and positive integers 
di, d 2 coprime to P(zq). Let U = U(d) be the set of products mn as (m, n) runs over Wd, 
the values mn being counted according to multiplicity. Then if d | P(zq) we see using the 
multiplicativity of h that (TTTTT) holds with Y = h(d)X, 

h*(d)= ^ ^(eie 3 , e 2 e 3 )p(e 3 ) 

d=e \e2G3 

and 

r(d)= J2 r(d 1 e 1 e 3: d 2 e 2 e 3 )fi(e 3 ). 

d=e ie2C3 
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In particular h*(p) = h(p, 1) + h(l,p) — h(p,p ) G [0,1), and (TTIT) holds with suitable 
Co = 2C\. The Fundamental Lemma sieve therefore shows that 

S(Wd, (zo, 2o)) = S(U(d), Zo) 

(15) = h(d)XV(z 0 ,h*) + 0(h(d)Xe~ s ) + O l ^ |r(dieie 3 , d 2 e 2 e 3 )| 

\eie 2 e3 <2g 

We can now apply the upper bound vector sieve. Suppose that z\,z 2 > Zo, and define 

p{z o,-) = n p - 

Z 0 <P<2 



Let 

W* = {(m,n) G W : (mn,P(zo)) = 1}. 

Then according to (1T31) we have 

S(W‘,z) = ^2 S((m,P(zo,zi)) 6 ((n,P(zo,Z 2 )) 

(m,n) £W* 

< '^2 ( ^2 I | ^2 ^2(^2) 

(m,n)GW* ydi|(m,P(2 0 ,2i)) J \d 2 \(n,P(zo,z 2 )) 

= E E A+(<ii)A 2 + (d2)#W^. 

dl|P(2 0 ,2l) d 2 |P(2 0 ,2 2 ) 


However = S(\Vd, (^o,^o)); whence (TTB1) shows that 

S(W; z) < XV(zq, h*) E + 0(E 1 ) + 0(C 2 ), 


where 

E= E E A+(*)A+(d 2 )ft(d) 

dl|P(2 0 ,2l) d 2 |P(2 0 ,2 2 ) 

and the error terms are 

Si = W- V 5^ ft(d) 

c?i<Di d,2<D2 


and 

E 2= r2 (/l) r2 (/2)l' r (/l 5 /2)|- 

fi<Diz% fc<D 2 z° 

(We write r(...) for the divisor function as usual.) 

To estimate E we wish to replace h(d) by hi(di)h 2 (d 2 ). These are equal when d\ and 
d 2 are coprime. Otherwise we note that 


(16) h(di, d 2 ) < Cq 1 [dld2 \didi) 1 <C r(d 1 ) Co T(d 2 ) Co (did 2 ) 1 

by (H2D, and similarly hi(di) -C r(di) c ' 0 (i]” 1 and h 2 {d 2 ) r(d 2 ) c ' 0 (i^ 1 . Hence if di and d 2 

are not coprime then 


h(di, d 2 ) = h(di, l)h(l, d 2 ) + 0(T(di) c °T(d 2 ) c °(d 1 d 2 ) x ). 
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This latter case will only hold if there is a prime p > Zq which divides both d\ and d 2 . 
As a result we may deduce that 

E = £ £ X+(d 1 )X+(d 2 )h 1 (d 1 )h 2 (d 2 ) 

di\P(zo,zi) d,2\P(zo,Z2) 

+ 0 (5Z T ( pe 1 ) 6 '° r ( pe 2 ) c '° (p 2 eie 2 )~ 1 

\p>zo e\<D\/p e2<D 2 /p 

The leading term factors as 



^ A t(d 1 )h 1 (d 1 

d 1 \P(z 0 ,z 1 ) 



\t(d 2 )h 2 (d 2 


d 2 \P(z 0 ,z 2 ) 


and so if hi and h 2 satisfy the condition ([TJ) the inequalities (jHJ) and d9j) will lead to an 
upper bound 


{F( Si ) + O l ((log^)- 1/6 )} {F(s 2 ) +O l {i\ogz 2 )- 1/% )}ViV 2 , 


with 


The error term is 


Vi= ll (1 -hi(p)), (i — 1,2). 

ZO<P<Zi 


< 2 (log-i) 2Co (log- 2 ) 2C, ° < Z Q \logZiZ 2 ) 2l+C °. 

p>z 0 


Hence if we take 

= exp (y/log ZiZ 2 ) 

then we find that 

£ < F(s 1 )F(s 2 )V 1 V 2 {l + 0((logz 1 z 2 )- 1 ^)} 

on observing that V t (logziz^ -1 , by (JZJ). 

The error term E\ is easily handled using (ITT)]) . This produces 


Ei < Xe s (logziz 2 ) 2l+C ° < Xexp{-(logziz 2 ) 1/4 } 


on choosing 

s = y/log Zi_Z 2 . 

The bound dll]) shows that V(zo,h*) (log^o) ^ 0 , whence we may conclude that 
Ei < XV(z 0 ,h*)ViV 2 F(si)F(s 2 )(logziz 2 )- 1/6 . 

Moreover if we write D = DiD 2 we have 

E 2 ^ ^ T(did 2 ) 4 \r(di 1 d 2 )\ ^ r(did 2 ) 4 \r(di, d 2 )\ 

d 1 <D 1 (z 1 Z2) e d2<D2(ziZ2) e d\d 2 <D 1 + e 

for any fixed e > 0. 

We can therefore summarize our result as the first statement in the following proposi¬ 
tion. 
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Proposition 1. Suppose that h(d ) satisfies lilSfl and that hfid) and h 2 (d) both satisfy 
Assume further that D = z s fiz s fi with 1 < si,s 2 •C 1, and log^i x logz 2 as in our 
setup of the Vector Sieve. Then 

S{W-z) < XV(z 0 ,h*)V 1 V 2 F(s 1 )F(s 2 ){l + O({\ogD)- 1 / 6 )} 

+ O s l ^2 T(d 1 d 2 ) 4 \r(d 1 ,d 2 

\did2<0 1 + £ 


for any fixed £ > 0. Indeed if we write 

log D 
log Zi ’ 


O’i = 


0 = 1 , 2 ) 


we may replace F(si)F(s 2 ) by 

F(a i, cr 2 ) := inf{F(si)F(s 2 ) : si/ai + s 2 /a 2 = 1, Si > 1 (i = 1, 2)}. 
Similarly we have 

S(W-,z) > XV(z 0 ,h*)V 1 V 2 f{a u a 2 ){l + O((\ogD)- 1 / e )} 

+ O e I *22 r{d , id 2 ) i \r{di,d 2 

\did 2 <D 1 + E 

for any fixed £ > 0, where 

/(cr i,cr 2 ): = sup{/(si)F(s 2 ) + f(s 2 )F(s 1 ) - F(s 1 )F(s 2 ) : si/ai + s 2 /a 2 = 1, 

Si >2(i = l,2)}. 


The lower bound is proved along the same lines as the upper bound, using (1T4|) in place 
of (USD- I 11 handling the expression corresponding to £ we encounter a leading term of 
the form 

EfEj + £ 2 -E+ - S+E+, 

where 

sf= £ A t(d)h,(d). 

d\P(zo,Zi) 

In general, if 

(17) U t > Tfi > Li > 0 and £7 > Li for % — 1,2 

then 

£7£+ + £^£+-£+£+ > + L 2 £+ - £+£+ 

= L 1 L 2 -(£+-L 1 )(£+-L 2 ). 

Since Yfif — L 2 > 0 and U\ — L\ > 0 the above expression is at least 

L x L 2 - (fj x - L!)(£+ - L 2 ) > LiL 2 - (Ifi - L x ){U 2 - L 2 ) = L x U 2 + L 2 U\ - U X U 2 . 

To complete the proof of the proposition we apply the above inequality with 

Ui = {F(si) + 0 L ((log A) -1/6 ) } Vi and L t = {f( Si ) + O l ((log A)” 1/6 ) } V t , 


the required inequalities (fI7|) following from our description of the linear sieve, given in 
subsection 13.11 and noting that log Di x logH for i — 1,2. 




3.4. Selberg’s sieve. Let W be a set of positive integers and for each prime p < z let 
Q (p) be a set of residue classes modulo p. We would like to estimate 

S'fW; z) = #{u> 6 W : w ^ Ll(p) for all p < z} 

using Selberg’s sieve. 

Suppose that 

(18) #{n <G W : [n mod p\ E f l(p) if p \ d and p < z} = h(d)X + r d 

for some multiplicative function h(d) E [0,1). Then the usual analysis of Selberg’s sieve 
(see Halberstam and Richert Theorem 3.2], for example) shows that 

S(W-,z)<-X + J2 3 “ <d) \^ 

G( * ) d<: 2 


in which 


G(z) =^p 2 (d)g(d) 


d<z 


where g is the multiplicative function supported on squarefree numbers defined by 

, ^ Kp) 

9(p) = TZWy 

For our applications we will have 

('2 + 0 (p~ 1 ), p<z 2 , 

(19) ph(p) = < 1 + 0(p _1 ), z 2 <p< Z \, 

I 0, otherwise, 

where 2 < z 2 < Z\ < z. 

We now need to develop the asymptotics for G(z). 

Proposition 2. Preserve notation as above, and define 

log z 


Let 


be Dickman’s function, defined by 

p(s) = 

and 

( 20 ) 

for s > 1. Further let 


Si , 

log Zi 

p : (0, oo) —> M 
0, s < 0, 


1, 0 < s < 1, 

sp\s) = -p(s - 1) 

B : (0, oo) 2 —» M 


be defined by 

( 21 ) B{s 1 ,s 2 )- 1 = e- 2 ^ 



p(w 1 )p(w 2 )dw 1 dw 2 . 


{{w 1 .W2):wi/si+W2/s2<l} 


G(z) ~ B(s 1 ,s 2 )V(z,h) 


Then we have that 

( 22 ) 

if 1 < si, s 2 <C 1. 
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We delay the proof of this result until §5J Note that the level of distribution required will 
be D = z 2 , and that we have taken s t = (log.D)/(21ogZj), rather than the more normal 
Si = (log-D)/(logZj). It is easy to translate to the latter notation, but the definition of 
B(si,S 2 ) would look rather less natural. 

3.5. A version of the Bombieri-Vinogradov Theorem. In the previous sub-sections 
we introduced remainder terms which can be bounded in our applications by using a 
suitable version of the Bombieri-Vinogradov Theorem. We begin by stating a convenient 
result from the literature. 


Lemma 1. For zi, Z 2 , ■■■, z r > 2, define the set with multiplicities 

(23) P(zi,...,Z r ) = {p [r) =Pl...p r ■ Pi > Zi,...,p r > Z r }. 

Let 7iy(x; q, a) be the number of p^ G P(z\, ...,z r ) such thatp ^ = a (mod q) andp ^ < x. 
Further let iifix] q ) be the number ofp^ G P(zi, ..., z r ) such thatp^ < x and (p^ r \ q) — 1. 
Then for any A > 0 there exists B = B(A) > 0 such that 


(24) 


£ 

(J<X 1 / 2 (log x)~ B 


max 
(“>?)= 1 


7 i r (x-, q, a) 


1 

Hq) 


7 T r (x]q) 


<Cx(logcc) A , 


where the implied constant depends only on r and A. 


This is Theorem 22.3 of Friedlander and Iwaniec (!]. 

Note that the result reduces to the classical version of the Bombieri-Vinogradov The¬ 
orem when r — 1. In our applications we will sometimes need to replace the set defined 
in Q23j) with sets of the form 

(25) P(zi, ..., Z r , 2 / 1 , ...,y r ) = {p [r) = Pl-.-Pr ■yi>Pl>Z 1 , ..., y r >Pr> Z r } 

for z\, ...z r , yi, ..., y r > 2 where we allow yt = 00 in which case the condition y % > p, 
is automatically fulhlled. The lemma clearly holds for these sets as well since we may 
express a set of the form (1251) in terms of sets of the form (1231) . using the inclusion-exclusion 
principle. 

We will actually need the following slightly different version of the above lemma. 


Lemma 2. Let P(z±,z r , yi,y r ) be as in (l25j) and fix notation as in Lemma Q For 
each q > 1, let 


(26) 


R a {x) = max 

(«,<?)=1 


7 T r (x-,q,a) - T r (x-,q) 


Then for any A > 0 and k > 1 there exists B = B(A, k) > 0 such that 


(27) r(q) k R q {x) <^x(logx) A , 

5<x 1 / 2 (loga;) _s 

where the implied constant depends only on r , k and A. 


Proof. For q < x we have 
(28) 




Y J T ( C l) 2k Rq( X ) < Xllogx) 2 ^ . 

q<x 


so that 
(29) 
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On the other hand, Lemma [Tj shows that for any A' > 0 we will have 
(30) y ^R q (x) <C x(\ogx)~ A ' 

g<Q 

for Q = x 1//2 (loga:) _ ' B, ^^. The result then follows from (1291) and (1HUD by applying the 
Cauchy Schwarz inequality to (1271) . and choosing A 1 sufficiently large in terms of A and 
k. □ 


4. Proof of the Theorem 


4.1. Bounding S(A] £i). We now take 

= x 9i (i = 1, 2) and y = x e , 


where 9 1 , 9 2 and 9 are constants satisfying 

0 < 0 2 < 9\ < - and 9 2 < 9 < 1. 

o 

Thus 9 = v ^ 1 in the notation of §2. 

We will apply the lower bound vector sieve to the set 

W = {{p + 2 ,p + 6) : x 1/3 < p < x — 6}, 

taking z = (£ 1; £ 2 ) and X = tt(x). Since p > 6 we see that we cannot have d\ \ p + 2 and 
d 2 | p + 6 unless (di, d 2 ) = (di, 2) = (d 2 , 6) = 1. We therefore set 

h(d) = f WTy if ( d i> = (di, 2) = (d 2 ,6) = 1, 

1 0, otherwise, 

and 

#W d = h(d) 7r(x) + R{d), 

whence Lemma [2] gives 

(31) £ 'r(did 2 ) A \R(d)\ -C x(logx) A , 

d 

d\d2<x x l‘ 1 ~ e 


for any positive constant A. 

We now use the vector sieve lower bound from Proposition [1] According to (|3ip . the 
remainder sum can be bounded adequately when D = x 1//2 ~ 2e . The Euler factors in 
V(z 0 , h*)V\V 2 are 

1 = 4(l - |) 2 for p = 2, 

1 — i = K 1 — I ) 2 for p = 3 > 


i - 


V 


1 - 


p — 1 


2 


3p- 1 
(P - !) 3 
1 

‘ (^l ) 2 



1\ 

- I for 5 
PJ 



<P< Zq, 
Zo<P < 6 , 


and 






for i 2 <p< &, 


P(zo,h*)WP 2 ~ 

n 


whence 









with 


(32) 

and 


c = iII 1 


p> 3 


3p — 1 


(P - x ) £ 


vw=n< i -p _i ) 


=-7 


p<z 


log Z 


Note that C is the constant appearing in the Hardy-Littlewood conjectures for such prime 
tuples. We therefore obtain the lower bound 

(33) S(-4;6) > (C + o(l))ir(i)l/« 1 )rK 2 )/((29 1 )- 1 ,(29 2 )- 1 ). 

4.2. The terms S(A p ]^i). We may apply the upper bound vector sieve with the same 
set W as before, noting that h(pd \, d 2 ) = 4 >{p)~ l h{d) when d\ \ P(£i), d. 2 \P{^ 2 ) and 
p > £, i- This easily leads to the bound 

s(A;6) < (c + o(i))^4rK 1 )V'(&)F(« 1 ( P ),s 2 (p)) 

p — 1 

0 £ ( r(d 1 d 2 ) 4 | J R(pd 1 ,d 2 )| ) , 

did2<p _1 -D 1 + e ) 


with 


log D/p 

Si{p) = - i—(* = 1,2). 


log& 


ffence for any P < P' we have 


£ s(A.;«0 < (c + o(i)Mi) ( £ 

P<P<P' \P<p<P' 


F(si(p),s 2 (p)) 

p — 1 


^(6)^(6) 


+ 0 £ | ^ r(did 2 ) 4 |i?(p(ii,d 2 

ypdld2<n ,1+e 

The remainder sum is negligible, by Lemma [21 if D — x l ' 2 ~ 2e . It follows that 

(34) £ s(-/tp;6)<(c + °(i)M*M6)rte) f 

P<P<P' 

where we now have 

(35) 


tlogf 


log y/x/t , oN 


Alternatively we can use Selberg’s sieve as in subsection 13.41 For a given prime p we 
take W = to consist of the values (q + 2) (q + 6) where q runs over primes in the 
interval x< q < x — 6 such that p \ q + 2. For each prime r we use the residue classes 


(36) 


Q(r) 


"0, r = 2, 

{-2}, r = 3, 

<{-2,-6} 5 < r < 6, 

{-2}, 6 < r < 6 

,0j r>Ci- 












It is natural to take A" = tt[ x)/{jp — 1) and 

fO, r = 2, 


(37) 


Let z > £i and write 


Then 


h(r) = < 




f, r = 3 

5<r<6, 

0 , r> 6 - 

log z 


log 6 


i = 1,2 


y(z, /i)~ cv(6M6). 

If we write rf' for the corresponding remainder in (U 8 l) we will have 
S{A p] ^) = -S(W;z) 


< 


7r(x) 


{p-l)G(z) 


dkz 2 


“MlWi 


(C + o(l))Lh-B(i, 1 , S2 )V'(6)V'« 2 ) + ^3“<' i '|rf|. 

^ d< 2 2 


Moreover 

„(p) 


r'f \ <r(d) max 

(a,pd)=l 


{ 7 r(a; — 6 ; pd, a) — n(x 1 ^ 3 -,pd, a)} 


7r(x — 6) — ^(x 1 / 3 ) 


<K<?) 


Lemma [2] then shows that if we choose z = {\fx/P) l ^ 2 {\.ogx) c ’° with a suitably large 
constant Cq then 


E s(A;6)<(c + o(i)Mz) 


P< P <P' 


\P<P<P' 


B(a 1 (p),a 2 (p)) 

p — 1 


^( 6 )^( 6 ), 


with 


log 


di(t) = 


provided that £i < z. We then deduce that 


y/x 

t 


1/2 


log ^ 


r -P' 


B(d 1 (t),o 2 (t)) 


tlogf 


X/ s(A>; 6 ) < (C + o(i)Mx)V'K,)vZ6) 

P<P<P f 

Comparison with (l34|i now shows that 

(38) XI <(c + o(i) )*(x)v(h)v (&)!(»!, fe), 

with 


dt. 


■1/3 


(39) J( 6 i 1 , 6 I 2 )= / a 1 min<F( 




1 — 2a 1 — 2a, A -2a 1 - 2a, 


20 ! ’ 209 


-).B( 


IS ’ 49 2 


) > da. 
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4.3. Estimating N 0 via Chen’s role-reversal trick. The number N 0 is defined in 
terms of products P 1 P 2 P 2 £ A. However we can change our point of view and write 

N 0 = #{p + 2 e B : p prime}, 

where 

B = {P1P2P3 : 6 < Pi < x 1/3 <p 2 < P3, x 1/3 + 2 < P 1 P 2 P 3 < x - 4, 

(P1P2P3 + 4 , P( 6 )) = !}■ 

Thus instead of sieving numbers p + 2 and p + 6 we will sieve numbers P 1 P 2 P 3 — 2 and 
P 1 P 2 P 3 + 4. This is Chen’s reversal of roles. Following the approach of subsection 13.41 we 
let 

W = {P 1 P 2 P 3 : 6 < Pi < x 1/3 <P 2 <P3, and P 1 P 2 P 3 < x - 4} 
and for each prime r we define the set 0(r) by 


(40) 


0(r) = < 


0, r — 2, 

{2}, r = 3, 

{2,-4} 5 <r<6, 
{2}, 6 <r < z 

0, r > z. 


It follows that 

iV 0 < 5(W; 2 ) 

for any z between £ 2 and a? 1 / 4 , say. 

It is natural to take A" = and to choose the function h(r) to be given by fl37|) as 
before, except that now h(r) = l/(r — 1) for £ 2 < r < z. With this definition we will have 

V(z,h) ~ CV(z) H(6) - 

where si = (log z)/ (log £ 1 ). Moreover if we define r d via (fTST) then we can use Lemma [2] 
with z = x 1 A(\ogx)~ B t 2 to show that 

(41) y; 3 aj(d) |r d | -C ^(loga;)" 71 . 

d<z 2 

In order to do this we replace W by the set 

Wo = {P 1 P 2 P 3 ■ £1 < Pi < x l/3 < P2,P3, and P 1 P 2 P 3 < x - 4}, 
to which Lemma [2] applies directly. We should also note that 

7 T r (x -4 ;q) = #W 0 + O(x^) = #W 0 + 0 (x( logx)"^ 1 ), 

on allowing for possible common factors of q and PiP 2 P 3 - The error term here is certainly 
small enough for f|41|) . 

We also need to estimate #W. We find that 

f-x 1 / 3 Ax/vj 1 / 2 


#W 


E 


x 


dudv 


P 1 P 2 log - L - 

£l<Pl<W 3 ^ ^ 6 

x 1 / 3 <p2<(x/pi l 1 / 2 


X 


(42) 
where 

(43) 


L(s) = 


'1/3 


a JxV 3 («log «) (u log v) log ( 

’k{x)L{6^[ 1 ) 

dad/3 


1- 

2 


h/s J 1/3 a/?(l-a-/ 3 )’ 

14 








by the change of variables u = x a and v = x@. We therefore conclude that 

(44) N 0 <(C + o(l))7r(a;)V r (ei)^(6)^(^r 1 )40i J B(l, (4 6 2 y l ). 

It is possible as well to apply the vector sieve here, but the bound (1441) is always 
superior for our application. 


4.4. The weighted sieve terms. We now turn our attention to 

\ 


(45) 


T'-= E w - + 

&<q< y ' 


r< E ^ + -#vf > , 

&<q<y ' ' 


V (1) = {n + 4 : n € A and (n, P(x 1,/4 )) = 1 }, 


We write 

(46) 

where 

(47) 
and 

(48) V (2) = {n + 4 : n € A and n = P 1 P 2 , £1 < Pi < x 1 / 4 < p 2 }. 

Note that £>-') yU) an d every element of B ^ is in with the exception of those 
n + 4 with n £ A, n = p!p 2 with pi,p 2 > x 1//4 , and those are counted in V (1 h 

We begin by examining ftVq' 1 . We will use the Selberg sieve as in subsection 13.41 To be 
precise, we take 

W = {p £ (x 1 / 3 , x — 6 ] : q \ p + 6 } 

and 

0 , r = 2 or q, 

{-2}, r — 3, 

{-2,-6} 5 <r<6, 

k I -2 }, ii<r<z,r^q. 


fl(r) = < 

For some z £ [^x 1 / 4 ], we choose 


X = 


9-1 


: x 1 / 3 < p < x — 6 } 


and use h given by 

(49) 


h(r) = 5. 


" 0 , r = 2 or g, 

1 r = 3 
= 1 , 5 < r < £ 2 , 

k yrp &<r<z, r^q, 


whence 

y(^,/i)~(7y(z)y(6). 

In estimating the individual terms in 

E #vf>. 


P<q<P' 
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Lemma[2]will allow us to use z = x 1 ^ A q 1 / 2 (loga:) B / 2 , provided that z > £ 2 - We therefore 
conclude that 


£ #vp < (C + o(l))ir(i) ]T -^V(z)Vfa)B 

P<q<P' P<q<P' " 


(1 logz 

\ ’ log 6 


Since logz ~ log(x 1//4 g 4 / 2 ) we have 

V(z) ~ H(£i) 

and we deduce that 


log 6 


log^VV 1 / 2 ) ’ 


5 ^ rorfV' 1 ) < (C + o(l)P(x)V'R 1 )V'R 2 )E, 

&<q<y 


with 

V = V l°g£i p A log x 1/4 g~ 1/2 \ 

? - 1 lo g(x 1/4 g- 1/2 ) V’ log 6 / 

In order to ensure that z > £ 2 , we impose the condition that 202 + 0 < 1/2. Bearing in 
mind the definition (J4]) of the weights w q = 1 — we apply the Prime Number Theorem 
to see that 


E ~ 


r\_/ 


f y log£i A _ logt\ / log(a: 1 / 4 f 1/2 ) \ 
& log(ad/ 4 g-i/ 2 ) V log v) V log 6 / 



40! 9 — a^ 
1 — 2 a ad 



l-2a\ 
40 2 ) 


da. 


dt 

tlogt 


For notational convenience, let 


(50) 

so that 

(51) 


J(01,02,0) 



40! fl-q 
1 — 2 a ad 



1 — 2cA 

402 ) 


da , 


£ m ,#vw < (c' + o(i))J(e 1 ,»2,»)ir(i)y(6)r(6), 

&<q<y 


if 202 + 0 < 1/2. 

As in the previous section, here too we could have used the vector sieve upper bound, 
but again the Selberg method is superior. 


/ 0'S ■ 

We now examine # Vq ; . Again, we will use the Selberg sieve as in subsection 13.41 but 
our approach to Vq 1 ' 1 and our approach to Vq 2 ' 1 differ. In our treatment of Vq 1 ' ) , we took W 
to be a set of primes p and used the sieve to handle the conditions that (p+ 2, P(x 1//4 )) = 1 
and (p + 6, P(£ 2 )) = 1. Here, we will take W to be a set of numbers of the form n = pip -2 
for pi and P 2 prime, and use the sieve to handle the condition that n — 2 is prime and 
(n + 4, P(£ 2 )) = 1 
To be precise, we take 

W = {P 1 P 2 e (ic 1/3 + 2, x - 4] : q | pip 2 + 4, £1 < p± < x 1/A < p 2 } 
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and 


fl(r) = < 


0, r = 2 or q, 

{2}, r = 3, 

{2,-4} 5<r<£ 2 , 

{2}, 6 < r < z, r ± q. 


We choose 


X = -#{piP 2 e (x 1/3 + 2, x - 4] : 6 <P\ < ^ 1/4 < M 


and use h given by (1491) as before. Recall that 

V{z,h)~CV(z)V{£ 2 ). 


We have 


X 


q -1 

1 


E 


x 


gl^pi^z 1 / 4 Pi 1°§ 

/*T 4 / 4 


x dt 


g-l./ 6 flog (f) log t 




W 4 du 


q -1 J(h M 1 - «) 

1 - 0 ! 


q-1 


n(x) log 


30i 


In estimating 


E #b 2) . 

P<q<P' 

Lemma [2] will again allow us to use z = x 1 ^q~ 1 ^ 2 {logx)~ B ^ 2 , provided that z > £ 2 . We 
therefore conclude that 


Y < { C + o(l))7r(a;) lo S 

P<q<P' 


1 - 0 ! 

30i 




P<q<P‘ 


log £ 2 / 


Continuing as in the previous section, we have 


l-0i 

30i 


J (0i, 0 2 , 0)tt(x)V (£i)V(£ 2 ), 


(52) J] w q #V^<(C + o( l))log 

&<q<y 

where J is as defined in (19UD . As in the previous section, here too we could have used the 
vector sieve upper bound, but again the Selberg method is superior. 


4.5. Summary. Putting 


and (HU) into flU and by (1911) , (1491) and (1941) , we have 


(53) Si - \ Y. + >C7rMV'K 1 )CK 2 )ff(9 1 ,9 2 ,9,A)(l + 0 (l)), 

£2 <p<y ' 

where 


(54) 


i/(0i,0 2 ,0,A) = /((20i)- 1 ,(20 2 )- 1 )- 


- A 


1 + g lo § 


1-0A 

30i J 


i/(9 1 ,9 2 )-2L(9 1 - 1 )9 1 S(l,(40 2 )- 1 ) 

J(9i.9 2 .9). 
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We chose 6 \ = 1/11, 0 2 = 1/410, 0 = 1/30 and A = 0.0145. Recall that / is defined as 
a snpremnm - using Matlab to conduct a rough search for the sup, we compute that 

/((20i) _1 , (202)" 1 ) > 0.9992523... 


When calculating 1 ( 61 , 62 ), the quantity 


min 



l-2a 

26 x 


1 — 2 a 

2 d 2 


M( 


1 — 2a 

40i 


1 - 2a, 
4 do ' 


appears, arising from use of both the vector sieve and our version of Selberg’s sieve. 
For our values of and d 2 , F(^p , is smaller for small values of a, while 

1 becomes a better choice at around a = 0.26. Again, a Matlab compu¬ 

tation with a rough optimization of the value of F (, 1=^), which was defined as an 
inf, gives that 

1 ( 61 , 62 ) < 1.5630111... 


It is simple to calculate that 


Further, using that 


L(d^) = 0.5477550... 


(55) / p(u)du = / up(u)du = e 7 , 

Jo Jo 

we have that 

poo pl—y/v poo pl—y/v 

R(l,n) _1 = e -27 / p(y) / ldxdy- / p(y) / ldxdy. 
00 00 01 ) 00 

On the other hand, using the crude upper bound p(n) < we have that 



bJ!’ 


while 



From this, it follows that 

B(l, (40 2 ) _1 ) = 1.7986199... 


Substituting the above estimate for B(l,v) into the definition of J( 61 , 62 , 6 ), we find 
that 

J( 6 i ,6 2 , 6 ) = 1.1235270... 

We find that H( 6 i, d 2 , 6 , A) > 0 for A < 0.0214. From (J5]) this gives a bound for r of 
the form r < 1/0 + 1/A < 77, giving the result that there are infinitely many primes p 
such that p + 2 has at most 2 prime factors and p + 6 has at most 76 prime factors. 
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5. The average of multiplicative functions appearing in Selberg’s sieve 
We end by proving Proposition [21 For i G {1, 2}, let 


Xiin) = 


1, if p\n p < Zi, 

0, otherwise. 

Recall that g is the multiplicative function supported on squarefree numbers defined by 

/ \ _ Hp) 

9P 1 -h(p) m 

We further define the multiplicative functions k and j by 

np, 2 (n)g(n) = (xi * k)(n) = (xi * X 2 *j)(n), 
for all natural numbers n, so that 


E 

n> 1 


np 2 {n)g{n) 


n a 


Xi ( n ) | ( \ ^ H n ) 


. n>l 


i n>l 


E^)(E 


X 2 (n) 


K n> 1 


v n>l 


n b 


E 

^n>l 


J(n) 


n a 


The Dirichlet series above clearly converge for Re s > 1. Moreover we see that 

E^=nf i +^)nf i -5)nf i -= 


r> 1 


p s 


P<z 1 


p- 


p<Z2 


p s 


Thus 


j(p) -C p 1 , and j(p e ) « 1 (e > 2). 

Similarly we find that 

( 1 + 0 (p~ 1 ), p< z 2 , 

k(p) = < 0(p~ 1 ), z 2 <p < z 1: and k(p e ) Cl (e> 2). 

[ 0, p > z i, 

These estimates suffice to show that 


(56) £ |j(r)|« e iU 2+ ' 

R<r<2R 

for any fixed £ > 0, and 

(57) E \k(m)\ <C M. 

M <m<2M 

In order to study G(z) wc will first examine the average of dp 2 (d)g(d), whose behaviour 
resembles that of xi * X 2 - We intend to take advantage of the fact that x.i and X 2 
are indicator functions of smooth numbers, and the computation of their averages are 
standard results. We have 

(58) n/x 2 (n)g(n ) = k(m)V(—; *i), 

z — ' Z — J m 

n<x m<x 


where T (x\ y) is the number of y-smooth numbers below x. It follows from a result of de 
Bruijn [2] that 












uniformly for 1 < y < x, where p is Dickman’s function defined as in the statement of 
Proposition [21 Continuing from (1581) . we have 


2/ w \ K m ) f^ogx/m 

> up (n)g(n) = x > - p — - 

' m \ log z\ 


n<x 


m<x 


+ o(xJ2 

\ m< x 


|fc(m)| 


m\og{2x/m) 


(59) 

upon observing that 


= x 


E 

m<x 


k(m) /log x/m 
m ^ \ log^i 


+ 0(xlog logx), 


E 

m<x 


|fc(m)| 


rn \og(2x/m) 


log log x 


by dSZD- A similar calculation yields 

Y fc ( m ) = 


^2 


m<y 


r<y 

= V 


(60) 

after noting that 


= V 


r<y \ & - / \ r < y 

EhihW +OWog2srl) , 

z -^ r \ log 2 ( 2 / 


r \og(2y/r) 




r<y 


r log(2 y/r) 


by (l56j) . Another application of (1561) shows that 

y^j(r) ( logy/r \ = y, j(r) /logy/r\ +Q 1/8 

r \ log Z2 J r V log *2 J 

Since p'(t) <C t~ l for t > 0 we have p () = P (ioS) + ^ (iojyfii) f° r r — y/Vi w l ience 
two more applications of (1551) yield 


j(r) / log y/r 
^ r ^ \ log 


r<y/y 


= p 


= p 


log y 
log £2 


ETHc^-E 


r<^y 


\j( r )\ logr 


r<y/y 


(^)E —+ 0 ((log2 j /) 
\l0g-2/^ T 


-1\ 


It therefore follows from (jUU]) that 


Y K m ) = C 0 yp + 0{y(\og2y) 1 ), 


m<y 


where 


(61) C 0 = Y = V(zi)V(z 2 )V{z, h) 1 ~ e 27 (log^i) ^logz-j) V(^, h) 


r=l 


-1 


Note here that Co <C 1, since Co can be written as a product of Euler factors each of 
which is 1 + 0 (p~ 2 ). 
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We may now insert the above formula into (1591) , using partial summation to deduce 
that 


^ j np 2 (n)g(n) = 


n<x 


m<x 


= CqX 


k(m ) /log x/m 
m ^ \ logzi 

log t 


r 1 d 

/, t dt 


The integral is 

f x 1 


:P 


logt 


1 t V lo g^2 


P 


log x/t 
log Zi 


tp 


dt + 


log 2:2 


+ 0{x log logo:) 
log x/t 


P 


log Zi 


1 , f log t 


p 


<1 t log Z 2 Vl°g-2 


p 


dt + 0(x log log x). 
log x/t' 


log 21 


dt. 


However p'(s ) = 0 for 0 < s < 1 and p'(s ) -C s 1 otherwise. Thus the second integral 
above is O (log log x) so that 


^2,np 2 {n)g{n) = C 0 x / p 


n<x 


log t 
log z 2 


p 


log x/t\ dt 

-h (A (x log log x). 


log 2:1 J t 


A further summation by parts now shows that 


n<z 


= C n 


P 


logt ^ (\ogz/t\ dt 
log z 2 


P 


1 


log Z! J t + J 1 X J 1 


P 


log t 
log Z 2 


P 


log x/t\ dt 

—dx 


log Z\ ) t 


0 ((log*) (log logs))- 


The hrst integral above is O(log^), which may be absorbed into the error term, while 
the second is 



(logZi)(logz 2 ) 

'{(u’l,Ul2):Uli,UJ2>0, Wl/S\+W2/S2<X\ 

The proposition now follows from (j6Tj) . 


p(w 1 )p(w 2 )dw 1 dw 2 . 
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